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ABSTRACT 

We present a new catalogue, the Imperial IRAS-FSC Redshift Catalogue (IIFSCz), of 
60,303 galaxies selected at 60 /mi from the IRAS Faint Source Catalogue (FSC). The 
IIFSCz consists of accurate position, optical, near-infrared and/or radio identifications, 
spectroscopic redshift (if available) or photometric redshift (if possible), predicted 
far-infrared (FIR) and submillimetre (submm) fluxes ranging from 12 to 1380 fim. 
based upon the best-fit infrared template. About 55% of the galaxies in the IIFSCz 
have spectroscopic redshifts and a further 20% have photometric redshifts obtained 
through either the training set or the template-fitting method. For S(60) > 0.36 Jy, 
the 90% completeness limit of the FSC, 90% of the sources have either spectroscopic 
or photometric redshifts. Scientific applications of the IIFSCz include validation of 
current and forthcoming infrared and submm/mm surveys such as AKARI, Planck and 
Herschel, follow-up studies of rare source populations, large-scale structure and galaxy 
bias, local multiwavelength luminosity functions and source counts. The catalogue is 
publicly available from |http: / / astro.imperial.ac.uk/~mrr/fes/| 

Key words: catalogues - surveys - galaxies: distances and redshifts -infrared: galax- 
ies - quasars: general - large-scale structure of Universe. 



1 INTRODUCTION 

The IRAS Faint Source Catalog (FSC; Mo-shir et al. 1992) 
contains 173,044 sources extracted from image plates of co- 
added data. It is 2-2.5 times deeper than the IRAS Point 
Source Catalogue (PSC), reaching a depth of ~0.2 Jy at 12, 
25 and 60 /im. The sky coverage of the FSC is limited to 
|6| > 20° in unconfused regions at 60 /im. For sources with 
high-quality flux densitjQ, the minimum reliability of the 
whole catalogue is ~ 99% at 12 and 25 /im and ~ 94% at 
60 /im. Around 41% of the sources are detected at 60 /im 
(FQUAL = 2 or 3 at 60 /im). 

The construction of a redshift catalogue of the FSC 
60 /im sources is made possible by overlaps (in terms of 
depth and area) with various surveys either spectroscopic or 
photometric, such as the Sloan Digital Sky Survey (SDSS; 
York et al. 2000), the Two Micron All Sky Survey (2MASS; 
Skrutskie et al. 1997) and the 6dF Galaxy Survey (Jones et 
al. 2004; Jones et al. 2005). We use two photometric red- 
shift techniques, the empirical training set method and the 
Spectral Energy Distributions (SED) fitting procedure, to 
provide estimates of redshifts from optical and near-infrared 
(NIR) broad-band photometry. 



* E-mail: lingyu.wang05@imperial.ac.uk 

1 In the IRAS FSC, the flux density quality (FQUAL) is classified 
as high (=3), moderate (=2) or upper limit (=1) 



Other recent or planned all-sky surveys include AKARI 
and Planck. The AKARI (previously known as ASTRO-F) 
All-Sky Survey, which ended on August 26th, 2007, has ob- 
served 94% of the sky from mid- to far-infrared (Murakami 
et al. 2007) . The 5a point source detection limit of the Far- 
Infrared Surveyor with a single scan coverage is estimated 
to be 2.4, 0.55, 1.4 and 6.3 Jy at 65, 90, 140 and 160 mi- 
cron respectively (Kawada et al. 2007; Wang et al. 2008). 
The ESA (European Space Agency) mission Planck is go- 
ing to map the Cosmic Microwave Background (CMB) with 
unprecedented angular resolution and sensitivity as well as 
produce all-sky catalogues of infrared and radio galaxies in 
the frequency bands ranging from 30 to 850 GHz. The study 
of these Planck extragalactic point sources can not only help 
clean up the foreground contaminants in the CMB images 
but also constrain galaxy formation and evolution models 
and allow searches for high redshift dusty galaxies. For the 
HFI channels, the estimated Planck All Sky Survey sensi- 
tivity at the 3<r level is 26, 37, 75, 180, 300 mjy at 2100, 
1380, 850, 550, 350 /im respectively (Planck Bluebook, ESA- 
SCI(2005)-1, Version 2) and thus many of the sources de- 
tected by Planck will also be present in the IRAS catalogues. 
Another ESA mission Herschel (Pilbratt 2004), which is to 
be launched together with Planck, will observe the universe 
in the far-infrared (FIR) and submillimetre (submm) range 
(approximately 57 - 670 /im) with the formation and evo- 
lution of galaxies and stars and stellar systems as its key 
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science goals. The SPIRE instrument (Griffin et al. 2007) 
will perform imaging in the broadband photometry mode 
centred at 250, 350 and 500 /jm with the predicted point 
source sensitivity in the range 8-11 mjy (5<r, 1 hr). To 
aid these missions, we have provided predicted fluxes at the 
relevant mission wavelengths. The Imperial IRAS-FSC Red- 
shift Catalogue (IIFSCz) will be an important input and 
validation catalogue for current and forthcoming wide-area 
infrared/submm surveys described above. 

There are also several extragalactic science programmes 
that can be carried out with the IIFSCz. The convergence 
of the cosmological dipole is particularly worth further in- 
vestigation as previous measurements with all-sky surveys 
such as IRAS or 2MASS have not yet shown a consensus on 
the convergence depth (Rowan- Robinson et al. 2000; Mailer 
et al. 2003; Erdogdu et al. 2006). While 2MASS samples 
the local universe (within 200 hT 1 Mpc) very well, it is not 
quite deep enough to detect potential contributions to the 
dipole signal from large distance. The IIFSCz provides a 
huge sample for large-scale structure and velocity studies, 
e.g. the baryon acoustic oscillation (Eisenstein et al. 2005), 
the two-point correlation function and its dependence on the 
FIR luminosity and star formation rate (Mann, Saunders & 
Taylor 1996; Szapudi et al. 2000; Hawkins et al. 2001), the 
local multiwavelength luminosity functions (Serjeant & Har- 
rison 2005), follow-up studies of rare source populations such 
as ultraluminous infrared galaxies (ULIRGs) and hyperlu- 
minous infrared galaxies (HLIRGs) and number counts. 

The layout of this paper is as follows. The criteria used 
to select galaxies from the IRAS FSC are described in Sec- 
tion 12.11 The issue of extended sources is discussed in Sec- 
tion 12.21 In Section [3] we firstly obtain spectroscopic red- 
shifts from a number of databases and then cross-identify 
(using the likelihood ratio technique) FSC sources with their 
optical, near-infrared and/or radio counterparts. In Sec- 
tion |4l we carry out photometric redshift estimation (us- 
ing both the training set and the template-fitting method) 
and then make flux predictions at FIR and submm wave- 
lengths. The overall properties of the IIFSCz is described in 
Section [5] Finally, discussions and conclusions are given in 
Section[6]and Section[7]respectively. We adopt a flat cosmo- 
logical model with A = 0.7 and ho = 0.72. 



2 CATALOGUE CONSTRUCTION 
2.1 Sample selection 

To obtain a complete sample of galaxies from the IRAS FSC, 
our selection criteria are: 

(1) To ensure reliability, we select sources with FQUAL 
> 3 and SNR > 5 at 60 /jm. There are two types of signal- 
to-noise ratio in the IRAS FSC, the SNR at a given pixel 
(LOCSNR) and the SNR which uses a noise value derived 
from a local region. The latter is adopted here. This condi- 
tion leaves us with 63,842 sources; 

(2) To exclude cirrus, we require log(S100/S60)< 0.8 
if FQUAL ^ 2 at 100 /im. The sample size is reduced to 
63,117. The upper panel in Fig. [2]shows the predicted 100- 
60 ^im colour as a function of redshift based upon four in- 
frared templates, cirrus, M82 starburst, Arp 220 starburst 
and AGN dust torus (Rowan-Robinson et al. 2004; Rowan- 
Robinson et al. 2008). It is clear that our constraint on the 
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Figure 1. Flux densities at 60 /an vs. i-G-band total magnitude 
diameter for around 28,000 FSC sources with 2MASS identifi- 
cations. The red symbols are the FSC sources associated with 
extended PSCz sources (i.e. D25 > 2.25'). The solid line shows 
the mean K s total magnitude diameter Dxtot = 2.57' for the red 
symbols and the dashed line is where S60 = 0.4 Jy. 

60-100 )im flux ratio should not exclude any infrared galaxy 
type for z < 4; 

(3) To discriminate against stars, we firstly require 
log(S60/S25)> -0.3 if FQUAL ^ 2 at 25 ^m and then 
log(S60/S12)> if FQUAL ^2 at 12 /im. A total of 60,381 
sources have met the above criteria, the faintest of which has 
a flux density of 0.12 Jy at 60 pm. Fig.0is the colour-colour 
diagram of FSC sources with detections at 12 and 25 )im 
after applying the 60-100 fim colour cut. The Rayleigh- 
Jeans predictions for the 12-60, 25-60, 25-12 /im flux ra- 
tio are the dotted lines, while our stellar rejection criteria 
are indicated by the solid lines. The concentrations of ob- 
jects to the upper right are stars. Admittedly, our colour 
cuts might be a little harsh and therefore some low-redshift 
AGNs might be missing from our sample (see the middle and 
lower panel in Fig[5J. By examining DSS (Digitized Sky Sur- 
vey) images and cross-identification in NED or Simbad (if 
available) of sources in regions where < log(S12/S60)< 
0.7 and log(S25/S60)> or < log(S12/S60)< 0.3 and 
log(S25/S60)< 0.3, we managed to retrieve 3 Seyfert galax- 
ies, 8 unidentified or confused galaxies, while the rest are 
mostly carbon stars, post-AGB stars etc. At this stage, our 
sample contains 60, 392 galaxies which forms the base cata- 
logue of the IIFSCz. However, we point out that the sample 
size will undergo one more change in Section 13.11 



2.2 Extended sources 

In the IRAS Faint Source Survey, the median filtering 
method was used to maximise detection of faint point 
sources. However, the fluxes of extended sources are at- 
tenuated. This may cause nearby galaxies to be incor- 
rectly excluded from our catalogue. In the construction 
of the PSCz catalogue, Saunders et al. (2000) used the 
ADDSCAN/SCANPI to derive fluxes for sources with blue- 
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Figure 2. Upper: Colour at 100-60 /im vs. spectroscopic rcdshift 
colour-coded by four infrared templates (black: cirrus; red: M82; 
green: A220; blue: AGN dust torus). Middle: Colour at 25-60 /im 
vs. spectroscopic redshift. Lower: Colour at 12-60 /im vs. spectro- 



light isophotal major diameters D25 > 2.25'. For sources 
with D25 > 8', they used fluxes from the catalogue of IRAS 
observations of 85 large optical galaxies (Rice et al. 1988). 
There are a total of 1402 galaxies with blue isophotal diam- 
eters > 2.25' in the PSCz, 1290 of which are associated with 
PSC sources. Of the 1290 PSC sources, 938 were found in 
the FSC after applying our first selection criterion and so 
we have adopted their PSCz fluxes. Of the remaining 112 
sources not found in the PSC, we identified 76 in the FSC 
and their fluxes were also replaced by PSCz fluxes. Some 
of the extended PSCz sources were not found in the FSC 
because they lie at \b\ < 20°. 

ADDSCAN was designed for accurate flux measure- 
ment of bright sources and extended sources which need 
higher in-scan resolution. From a sample of 62 unambigu- 
ously extended galaxies in the Virgo Cluster, the ratio of 
the addscan flux to the FSC flux stays close to unity for 
sources fainter than ~0.4 Jy at 60 /jm (Moshir et al. 1992). 
Indeed, almost all FSC sources cross-matched with extended 
PSCz sources (D25 > 2.25') have 560 > 0.4 Jy. In order to 
estimate the likely number of FSC sources to have their 
fluxes seriously underestimated, we show the A^-band total 
magnitude diameter (Dxtot) for around 28,000 FSC sources 
with 2MASS identifications (see Section 3.3) in Fig. \T\ The 
mean K s total magnitude diameter for FSC sources associ- 
ated with extended PSCz sources (red symbols in Fig. [1} 
is D K tot = 2.57' ± 0.90'. Therefore, the K s total mag- 
nitude diameter Dpctot is approximately the same as the 
blue isophotal diameter D25 (Jarrett et al. 2003). In to- 
tal, there are 1677, 430, 97 galaxies with S60 > 0.4 Jy 
and DKtot > 1.7', 2.6', 3.5' respectively. Excluding extended 
sources whose fluxes have already been replaced by PSCz 
fluxes, we conclude there is still <~ 1% of the galaxies in 
our catalogue suffer from flux underestimation. 



3 SOURCE IDENTIFICATION 

3.1 Redshift compilation from NED, FSSz, PSCz 
& 6dF 

Having constructed the base catalogue, the next step is to 
obtain spectroscopic redshifts from past redshift surveys and 
the literature. The NASA/IPAC Extragalactic Database 
(NED) currently contains 10.4 million objects, 1.4 million 
redshifts and 16.0 million cross-identifications based upon 
astrometry, photometry and avoidance of confusion. 

An NED all-sky querj0 of FSC sources at \b\ > 20° 
with S60>0.1 Jy returned 64,219 objects (we will refer to 
this sample as NED-FSC). In the NED, two constraints 
were used to select FSC sources, log(S60/S25)> —0.3 and 
FQUAL^ 3 at 60 [im. Therefore, our sample is expected to 
be smaller than the NED-FSC. The NED position of a given 
FSC source is the best possible position, that is to say opti- 
cal or near-infrared position if available and IRAS position 
if not. The accuracy of IRAS position depends on the size, 
brightness and spectral energy distribution of the source but 



2 An all-sky query in NED usually returns a list of sources with 
information such as source name, type, position and redshift. 
However, photometric data for each source is not included and 
has to be retrieved individually. 
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Figure 3. Colour-colour distribution of sources detected at 12 and 25 /an after applying the cirrus rejection criterion. The dotted lines 
are the Rayleigh-Jeans predictions, while the solid lines indicate our stellar rejection criteria. Sources with log 10(S25/S60)< 0.3 and 
log 10(S12/S60)< are selected as galaxies. 



is usually <20" (1-cr). Within a radius of 300", 60,320 out 
of 60,392 sources in our base catalogue are matched with 
sources in the NED-FSC, more than 23,000 of which have 
spectroscopic redshifts in the NED. A manual checking of 
the matched sources with separations ^80" (~170 objects) 
proved that these cross-identifications are correct. Fig. [4] 
shows the number of cross-identifications as a function of 
the angular separation between the IRAS position and the 
NED position. 3 FSC sources are matched manually to their 
cross-identifications due to their large positional separations. 
44 FSC sources are not found in NED. In addition, we have 
removed a few FSC sources which are identified in NED as 
galactic objects from our base catalogue. This is the final 
change to the size of our base catalogue which now contains 
60,303 galaxies. 

The FSS redshift survey (FSSz; Oliver, PhD thesis) cov- 
ering 700 deg 2 measured redshifts for 1,546 FSC sources. It 
provides an additional 568 redshifts for our catalogue. 

So far, around 2,000 bright FSC sources (S60>0.6 Jy) 
in our base catalogue do not have spectroscopic redshifts. 
The PSC Redshift Survey (PSCz; Saunders et al. 2000) is 
complete to S60=0.6 Jy. NED has not folded in the PSCz 
due to issues such as significant positional difference and 
incorrect NGC/IC/MCG cross-identifications. We managed 
to get 1,972 spectroscopic redshifts from the PSCz via exact 
source name matching. 

The 6dF Galaxy Survey covers roughly 2/3 of the south- 
ern sky, measuring redshifts for over 80,000 galaxies. The 
primary redshift sample is selected from the 2MASS Ex- 
tended Source Catalog (XSC), together with 13 other sam- 
ples such as the SuperCOSMOS catalogue, the ROSAT All- 
Sky Survey and the IRAS FSC. NED used a 3" radius in 
matching the 6dF with the FSC, however, matches were 
not made if there were more than one 6dF object within 



the IRAS beam. In total, NED has matched ~ 6000 FSC 
sources with the 6dF observations. 

In the 6dF database, 10111 objects appear in both the 
IRAS FSC and the SPECTRA table where all the obser- 
vational and redshift related information are stored. Again, 
by matching FSC source names, the 6dF provides an ex- 
tra 2,794 redshifts with acceptable quality (on quality scale 
Q=3 or 4). 

To summarise, we have obtained 29,037 spectroscopic 
redshifts from NED, FSSz, PSCz and 6dF, which comprises 
48% of our base catalogue. However, there are a few thou- 
sand more redshifts to be gained from the SDSS DR6 survey 
(see below). 

3.2 CROSS-MATCH WITH SDSS 

The SDSS photometric DR6 survey has covered 9,583 deg 2 
at u,g,r,i,z with magnitude limits 22.0, 22.2, 22.2, 21.3, 
20.5 respectively. The spectroscopic DR6 survey has covered 
7,425 deg 2 , the main samples of which are magnitude-limited 
at Petrosian r< 17.77 for galaxies. 

As mentioned in Section 13.11 one does not get photo- 
metric data using the NED all-sky search. Therefore, in or- 
der to get optical magnitudes, we have to cross-identify FSC 
sources with their optical counterparts in the SDSS DR6 cat- 
alogues. In addition, because NED has only entered galaxies 
from the spectroscopic DR5 survey (covering 5,740 deg 2 ) so 
far, we can also get new redshifts from the spectroscopic 
DR6 survey. 

3.2.1 The likelihood ratio technique 

We use the likelihood ratio technique (LR; Wolstencroft et 
al. 1986; Sutherland & Saunders 1992; Ciliegi et al. 2003; 
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Figure 4. Distribution of the angular separations between the 
IRAS positions and the NED positions of the FSC sources. 

Brusa et al. 2007) to cross-identify FSC sources with their 
optical counterparts. The LR essentially compares the prob- 
ability of a candidate being the true counterpart as a func- 
tion of magnitude m and separation d with that of a chance 
association, i.e. 

^ p(m,d) dm dx dy q(m)f(d) 
n(m) dm dx dy n(m) 

The probability distribution function f(d) is usually as- 
sumed to be a two-dimensional Gaussian, 

f(d) = -^-«p[-i(4+4)i 

= ^^exp(-id 2 ), (2) 

where o\ and 02 are the axes of the positional uncertainty el- 
lipse of a given FSC source, d\ and are the positional sep- 
aration along the axes and d is the normalised angular dis- 
tance. For FSC sources, the mean 1-a positional uncertainty 
along the minor and major axes are 5 and 18 arcsec respec- 
tively. The magnitude distribution function of background 
objects n(m) is taken from objects around some random 
positions in the sky. The magnitude distribution function of 
the true optical counterparts q(m) is obtained by subtract- 
ing n(m) from the magnitude distribution of objects each 
FSC sources. In Bayesian inference, it can be shown that L 
contains all the information about the probability of a can- 
didate being the true counterpart. In cases where there are 
multiple candidates, the one with the highest likelihood is 
selected as the true counterpart. 

3.2.2 Obtaining redshifts from SDSS DR6 

Firstly, we need to test the reliability of the LR method. 
This is easily achievable as we have already obtained 29,022 
spectroscopic redshifts in Section [3.11 which can be used to 
compare with the redshifts of the optical counterparts found 
in the spectroscopic DR6 survey. 

In the spectroscopic DR6 catalogue, we search for all 



primary object^] within 1 arcmin from the IRAS position of 
each FSC source with known spectroscopic redshift gained 
in Section 13.11 Although in principle we can use accurate 
positions for these FSC sources, the IRAS positions are used 
to investigate the impact of large positional uncertainty. 

Of the 29,022 FSC sources with spectroscopic redshifts, 
8,979 were found to have optical counterparts and, of these, 
an average of 1.2 optical counterparts were found for each. 
We exclude candidates with r > 17.6 (above the magnitude 
limit) and z < 0.0005 in order to exclude stars. Fig.[5]shows 
the r-band magnitude distribution of the true counterparts 
and that of the random background objects. The thick solid 
line is a Gaussian fit to g(m), 

1 \ 1 1/^-15.31x2, ,„s 
q(m) oc exp[--( — )] (3) 

and the thin solid curve is an exponential model of n(m). 
Using the LR method, we managed to cross-match 8166 FSC 
sources with optical galaxies in the spectroscopic DR6 sur- 
vey. For an FSC source, if its redshift obtained in Section [3~T1 
agrees with the redshift of its optical counterpart, then we 
say that the cross-identification is correct. Thus, the reliabil- 
ity, defined as the ratio of the number of correction identifi- 
cations to the total number of sources in the cross-matched 
sample, 

Reliability = °° rrect , (4) 

"total 

of the cross-matched sample is estimated to be ~ 97%. 

Now we can apply the same procedure to the 31,260 
FSC sources which did not receive spectroscopic redshifts in 
Section 13.11 Using a search radius of 1', 4654 FSC sources 
are matched with 5462 optical candidates. The dashed curve 
in Fig. [5] is a Gaussian fit to the new q(m), where the peak 
is now shifted to ~16.8. The LR technique gives rise to new 
optical identifications and spectroscopic redshifts for 3,844 
FSC sources. 

3.2.3 Obtaining photometry from SDSS DR6 

Source identification becomes much more complicated in 
matching FSC sources with the photometric DR6 catalogue 
as the probability of chance association is oc rnrd 2 , where n 
is the number density of background optical objects. Using a 
search radius of 1', an average of 28 optical counterpart can- 
didates were found for each FSC source. The LR approach 
produced 9,425 cross-identifications between the sample of 
31,260 FSC sources and the photometric DR6 catalogue. 
The adopted g(m) is the blue dotted line in Fig. [S] SDSS 
objects classified as stellar are rejected not only because the 
contamination caused by stars is significant but also because 
the vast majority of the QSOs already have redshifts. 

To estimate the reliability of our cross-matched sample, 
firstly let F denote the set of 9,425 FSC sources and O the 
set of optical counterparts. The reliability of O is simply 
the ratio of the number of true optical ids in O to the size 
of O. Consider a subset T C F for which the true optical 
counterparts T are known. If the set of optical counterparts 

3 Whenever the SDSS makes multiple observations of the same 
object, the one with the best photometry will be assigned as the 
'primary' observation. 
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Figure 5. The r magnitude distribution of objects in the SDSS spectroscopic DR6 around FSC sources with redshifts obtained in 
Section |3, II (black dotted line). The triangles show the magnitude distribution of random background objects, i.e. n(m), and the thin 
solid curve is an exponential fit to these points. The asterisks depict the magnitude distribution of the true optical counterparts, i.e. 
q(m), and the thick solid line is a Gaussian fit to these points. The dashed curve is a Gaussian fit to q(m) for optical objects around 
FSC sources which do not gain redshifts in Section l3.ll The vertical line is where r=17.6. 



of T found using the LR method is denoted O C O, the 
reliability of O is given by the ratio 

Reliability = (5) 
No 

where the numerator is the size of the intersection between 
O and T and the denominator is the size of O. Assuming the 
subset T is representative of F, the reliability of the subset 

should reflect the reliability of O. 

In order to get reliable optical identifications, accurate 
positions of the FSC sources are needed so as to reduce the 
probability of chance association. In the following, three test 
samples are used: 

• We select 5,000 FSC sources with 2MASS identifica- 
tions, the positional uncertainty of which is around 1.25" at 
the 95% confidence level. Using a search radius of 1" around 
the 2MASS positions, 1,325 FSC sources (out of 5,000) were 
matched with unique optical counterparts in the photomet- 
ric DR6 survey (we will refer to this set of optical ids as 
'2MASS_larcsec'). Using '2MASS_larcsec' as T, the relia- 
bility of O is estimated to be 83%. Excluding 'stellar' optical 
objects, the reliability is increased to 88%. 

• The Faint Images of the Radio Sky at Twenty centime- 
tres survey (FIRST; Becker et al. 2004) with a sensitivity of 

1 mjy at 1.4 GHz and an angular resolution of 5" yielded 
a catalogue of ~811,000 sources. The survey area overlaps 
with that of the SDSS and the radius of the 90% confidence 
error circle is less than 1". An average of 1.1 FIRST sources 
were found within 1' from an FSC source. We have se- 
lected 4,886 unambiguous FSC-FIRST cross-matches, 3,273 
of which have unique optical counterparts in the photomet- 



ric DR6 survey ('FSC-FIRST-SDSS')- Using 'FSC-FIRST- 
SDSS' as T, the reliability of O is 80%. 

• The sample of 3844 new optical identifications obtained 
in Section [3. 2. 21 serves as the third test sample and it gives 
us a reliability of 76%. 

Thus, on average the optical identifications obtained 
from the SDSS photometric DR6 are ~ 80% reliable. 

3.3 CROSS-MATCH WITH 2MASS 

The 2MASS has uniformly surveyed the whole sky in J, H 
and K s , reaching a median depth of z = 0.073. To date, 
the 2MASS XSC has been comprehensively assimilated and 
cross-matched with other surveys in NED. Therefore, it is 
fairly straightforward to retrieve 28,640 cross-ids between 
the IIFSCz and the 2MASS XSC, around 7,000 of which 
do not have redshift information. We have also identified 39 
FSC sources in the 2MASS Point Source Catalogue (PSC), 
25 of which are shown to be QSOs. 

3.4 CROSS-MATCH WITH NVSS 

The NRAO VLA Sky Survey (NVSS; Condon et al. 1998) is 
a moderately deep radio survey over 82% of the sky. It has 
produced a catalogue of nearly 2 million sources brighter 
than 2.5 mjy at 1.4 GHz. The observed extragalactic sources 
include nearby normal galaxies, AGNs, star-forming galax- 
ies, starbursts etc. The rms positional uncertainties are ^ 1" 
for sources brighter than 15 mjy and 7" for sources above 
the detection limit. Given the almost linear FIR to radio lu- 
minosity correlation (Helou, Soifer & Rowan- Robinson 1985; 
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de Jong et al. 1985; Yun, Reddy & Condon 2001; Appleton 
et al. 2004), the NVSS is expected to detect most of the FSC 
sources and therefore provide much more accurate positions 
for these sources. 

Unique radio counterparts were found for 23,183 FSC 
sources within a radius of 1' and multiple counterparts were 
found in ~440 cases. Thus the cross-identification accuracy 
is ^ 98% even if we adopt a crude nearest-object cross- 
matching approach. The NVSS provides cross-ids for 3123 
FSC sources without any observation other than IRAS. 



4 PHOTOMETRIC REDSHIFT ESTIMATION 

4.1 THE TRAINING SET METHOD 

We use two different techniques, the empirical training set 
and the template-fitting technique, to estimate photometric 
redshifts for FSC sources with optical, NIR and/or radio 
photometry. In this section, we apply the public Artificial 
Neural Networks code of Firth, Lahav & Somerville 2003; 
Collister & Lahav 2004 (ANNz) to the IIFSCz and results 
are presented and discussed. 

ANNz requires a representative training selQ to learn 
the functional relationship between photometry and red- 
shift. Advantages of the training set method include nulli- 
fying systematic effects, freedom and flexibility in choosing 
input parameters, greater accuracy and efficiency etc. Hav- 
ing said that, the photometric redshift accuracy strongly 
depends on the quality of the training set. In principle, the 
training set should occupy the same region in the parameter 
space (colour, redshift, spectral type etc.) as the testing set. 

The structure of ANNz can be roughly divided into 
three passages, the input layer, the intervening/hidden lay- 
ers and the output layer. In our case, it can be viewed as: 
magnitudes at various wavebands — > functional mapping — > 
photometric redshift. Each layer consists of a number of 
nodes. The network architechture is determined by the num- 
ber of filters at the input layer, the number and size of each of 
the hidden layers and the number of outputs. For example, a 
3.10.10.10.1 network architecture has photomeric measure- 
ments in three filters, three hidden layers each of which has 
ten nodes, and one output, i.e. the derived photometric red- 
shift. The optimisation of the mapping is achieved by tuning 
the network weights associated with connected nodes. The 
training process is terminated when the cost function, de- 
fined as (z p hot — z spec ) 2 , is minimal on the validation set. 
ANNz gives two types of errors, photometric noise and net- 
work variance. The latter is obtained by using a number of 
networks known as a committee. 

For FSC sources with 2MASS counterparts, we choose 
the elliptical isophotal aperture based on the K s 20 
mag/arcsec 2 isophote which gives a good estimate of the in- 
tegrated flux and colour. The 2MASS training set contains 
21050 galaxies, 6664 of which are separated out to form a val- 
idation set. The 2MASS testing set contains 6771 galaxies. 
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Figure 6. Colour-colour distribution of FSC sources with 2MASS 
(upper) or SDSS cross- ids (lower). 



For FSC sources with SDSS counterparts, the SDSS model 
magnitudes obtained through the best-fit model (either a 
pure deVaucouleurs or an exponential) in the r band is a 
good choice for accurate colours. The SDSS training set con- 
tains 10096 galaxies, the testing set contains 5586 galaxies 
and the validation set contains 1856 galaxies. Extinction cor- 
rected is applied using the full-sky 100 /J,m maps (Schlegel, 
Finkbeiner & Davis 1998). The colour-colour distribution of 
the training (black plus signs) and testing set (red plus signs 
for the 2MASS cross-ids and blue plus signs for the SDSS 
cross-ids) is shown in Fig.[S] The SDSS training set is not as 
representative as the 2MASS training set. The photometric 
redshift error defined as 



4 A training set is a collection of sources with accurate redshifts 
and the same filter set as the testing set for which we would 
like to estimate the photometric redshifts. A Validation set is a 
random collection of sources with known redshifts. However, the 
validation set is not used in the network training process. 



/ Zphot 



(6) 



is 1.5% for the 2MASS validation set and 2.3% for the SDSS 
validation' set (see Fig. [TJl . 
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Figure 7. Upper: Photometric redshift versus spectroscopic red- 
shift, using 3 near-infrared bands and a 3.10.10.10.1 network ar- 
chitecture. Lower: Photometric redshift versus spectroscopic red- 
shift, using 5 optical bands and a 5.10.10.10.1 architecture. 



4.2 THE TEMPLATE-FITTING TECHNIQUE 

Another widely-used photometric redshift estimation tech- 
nique is the template-fitting method where the observed 
fluxes are compared to those from a library of SED tem- 
plates (based upon observations or population synthesis 
models) which represent different galaxy populations. Un- 
like the training set method, a representative set of galaxies 
with known redshifts is not needed. And once the spectral 
type and redshift are determined through \ 2 minimisation, 
we can carry on to predict fluxes at other wavelengths as 
well as determining useful parameters like the extinction, 
bolometric luminosity, stellar mass etc. 

Here we apply the template-fitting method that has 
been used to construct the SWIRE Photometric Redshift 
Catalogue (Rowan-Robinson et al. 2008 and references 
therein) to the IIFSCz galaxies with either optical or near- 
infrared photometry. We use only a single pass through the 
data and use a resolution of 0.002 in logio(l + z). We use 



Figure 8. Upper: The cumulative redshift completeness as a 
function of the 60 micron flux density. The dashed line is the 
completeness curve with only spectroscopic redshifts included and 
the solid line is with both spectroscopic and photometric redshifts 
included. The vertical line is S60=0.36 Jy. Lower: Redshift his- 
togram for IIFSCz galaxies. 



6 galaxy templates (E, Sab, Sbc, Scd, Sdm and starburst) 
and 3 QSO templates, as in RR08. The resulting rms error 
in (1 + z) for galaxies is 2.7% for 2MASS sources (JHK) 
and 4.4% for SDSS sources (ugriz), and the outlier rate is 
< 0.1%. Although the rms values are slightly worse than the 
neural network method, the template method does have the 
advantage that it is able to predict redshifts which lie out- 
side the range of values in the training set. This is a definite 
issue for the SDSS sources and for these we have adopted the 
order of priority (1) spectroscopic redshift, if available, (2) 
template method redshift, (3) neural network method red- 
shift. For other sources (2MASS sources or NVSS sources), 
we have adopted (1) spectroscopic redshift, (2) neural net- 
work method redshift, (3) template method redshift. 

We are able to determine a redshift for around 74% of 
the sources in our catalogue. For these sources we use the in- 
frared template fitting method of RR08 to fit four infrared 



templates (cirrus, M82 starburst, Arp220 starburst, AGN 
dust torus) to the 12-100 /im IRAS data, using spectroscopic 
redshifts where available. Parameters of these infrared tem- 
plate fits are given in the catalogue, including the infrared 
luminosities in each component. These template fits are used 
to predict the fluxes at 12, 25, 60, 90, 100, 110, 140, 160, 
250, 350, 500, 850, 1250, and 1380 fim, which cover the sur- 
vey wavelengths of AKARI, Planck- Surveyor and Herschel. 
We do not attempt infrared template fits for sources with 
2 < 0.0003 (essentially Local Group objects), since the FSS 
fluxes for these are likely to be serious underestimates. We 
believe the combination of precise optical, radio or near in- 
frared positions, redshifts and predicted submillimetre fluxes 
will make this catalogue invaluable for future large-area far 
infrared and submillimetre surveys. 

For the remaining ~15,000 sources we do not at the mo- 
ment have optical identifications or good positions, but we 
note that most of those that fall outside the areas surveyed 
by SDSS will be similar to the SDSS sources. 



5 CATALOGUE DESCRIPTIONS 

The columns in the IIFSCz include source name (as ap- 
pears in the IRAS FSC), position and flag (1=SDSS, 
2=2MASS, 3=NVSS, 4=NED, 5=FSC and prioritised in the 
same order), IRAS fluxes and flux-quality flags, ugrizj H K 
magnitudes and errors, photometry flag (1=2MASS XSC, 
2=2MASS PSC), integrated 1.4 GHz flux density and error, 
spectroscopic redshift and flag (1=SDSS, 2=PSCz, 3=FSSz, 
4=6dF, 5=NED and prioritised in the same order), ANNz 
photometric redshift and error, template-fitting photometric 
redshift, adopted redshift, optical galaxy template type, ex- 
tinction Av from optical galaxy template fit, reduced \ 2 f° r 
galaxy template fit, absolute B magnitude, optical bolomet- 
ric luminosity, fraction of contribution at 60 fim of cirrus, 
M82, AGN and A220 infrared template, bolometric lumi- 
nosity in cirrus, M82, AGN and A220 component, infrared 
luminosity, infrared template type (=1 for cirrus galaxies, 
=2 for M82 starbursts, =3 for A220 starbursts, =4 for AGN 
dust tori, i.e. L tor > Lm&2), reduced x 2 for infrared tem- 
plate fit, predicted fluxes at 12, 25, 60, 90, 100, 110, 140, 160, 
250, 350, 500, 850, 1250 and 1380 /im, other source names 
and types. The Catalogue and description are available from 
|http:/ /astro. imperial.ac.uk/~mrr/fss/ 1 

In Fig. [8] the cumulative redshift completeness is plot- 
ted against the 60 fim flux. The vertical line shows the in- 
trinsic 90% completeness limit of the FSC, 0.36 Jy at 60 fim. 
Consequently, for research programmes such as the cosmo- 
logical dipole where sample completeness is desired, the up- 
per panel in Fig. 7 should be taken into account. The IIFSCz 
covers about 61% of the whole sky. In Fig. |9j the sky dis- 
tribution is plotted for all galaxies in the IIFSCz, galaxies 
with either spectroscopic or photometric redshift and galax- 
ies without any redshift estimate. 

To summarise, the IIFSCz contains a total of 60,303 
galaxies, 55% of which have spectroscopic redshifts from 
NED, FSSz, PSCz, 6dF and the SDSS spectroscopic DR6 
survey (see the breakdown of spectroscopic redshift sources 
in Table [JJ and 20% of which have photometric redshifts 
from either the empirical training set or the template-fitting 
method. At a flux limit of S60=0.36 Jy, more than 90% of 
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Table 1. Source of spectroscopic redshift for the IIFSCz 



Source of z spec 


Number 


Fraction 


NED 


23,703 


39.3% 


IRAS FSSz 


568 


0.9% 


IRAS PSCz 


1,972 


3.3% 


6dF Galaxy Survey 


2,794 


4.6% 


SDSS spectroscopic DR6 


3,844 


6.4% 


Total 


32,881 


55.0% 



the galaxies in the IIFSCz have either spectroscopic or pho- 
tometric redshifts. There are 344 known QSOs and only 6 
of these do not have redshift information. 



6 DISCUSSIONS 

Fig. [10] shows the distribution of infrared bolometric lumi- 
nosity, Li r , versus optical bolometric luminosity, L opt , from 
our template fitting. The figure is colour-coded by the in- 
frared template type, and only galaxies with at least two 
detected far infrared fluxes are included. Most cirrus galax- 
ies have Li r < L op t, as expected if the emission is from an 
optically thin interstellar dust distribution. However there 
are some cirrus galaxies with Li r > L op t, indicating a higher 
dust optical depth. There is also the interesting popula- 
tion of cool luminous galaxies, with Li r > 10 12 Lq, dis- 
cussed by Rowan- Robinson et al. (2005, 2008). The highest 
infrared luminosities are dominated by Arp 220 template 
types, but there are significant numbers of ultraluminous in- 
frared galaxies which are M82 template types. Objects with 
Lopt > 10 12 Lq are Type 1 QSOs and most have Li r < L opt , 
as expected if the infrared emission is dominated by a dust 
torus illuminated by the QSO. 

One of the interesting discoveries of the IRAS Faint 
Source Catalog was the existence of hyperluminous infrared 
galaxies (Rowan-Robinson et al. 1991, Rowan- Robinson 
2001), galaxies with infrared luminosities > 1O 13 L0. In our 
Catalogue we now find 159 hyperluminous infrared galax- 
ies with S60 > 0.2 Jy, of which 38 have S60 > 0.36 Jy, our 
completeness limit. Fig. [TTJ shows the distribution of bolo- 
metric infrared luminosity, Li r , versus redshift, colour-coded 
by the infrared template type. Only galaxies with more than 
one far infrared flux detected are included. In our template 
fitting we do not permit the cirrus template to be used if 
Lir > 10 13 Lq. We see that the highest luminosities are 
dominated by the high optical depth starburst, Arp 220, 
templates, but with some sources dominated by AGN dust 
tori and a few M82 starbursts. 

There are 50 galaxies with Li r > 10 14 Lq, of which 15 
are QSOs, and 8 with L; r > 10 15 Lq. The latter include 
the well known hyperluminous galaxies IRAS F10214+4724 
and 08279+5255, both of which are lensed and are discussed 
in Rowan- Robinson (2001). Five of these eight galaxies have 
spectroscopic redshifts. The hyperluminous infrared galaxies 
in our Catalogue will be discussed in a subsequent paper 
(Rowan- Robinson et al. 2009, in preparation). 

Fig. fT2] illustrates our predicted submillimetre fluxes 
derived from our template fits. It shows the predicted 350 
fim flux versus redshift, again colour-coded by infrared tem- 
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Figure 9. The sky distribution of all galaxies in the IIFSCz (top), galaxies with either spectroscopic or photometric redshift (middle) 
and galaxies without redshift (bottom), in galactic coordinates. The depth of the IIFSCz is affected by variations in the IRAS coverage 
across the sky. Note that in the bottom panel, the redshift completeness is very high in regions covered by the SDSS DR6 survey. 
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Figure 10. Infrared bolometric luminosity (Li r ) versus optical 
bolometric luminosity (L pt), colour-coded by the infrared tem- 
plate type. 




0.2 0.4 0.6 



Figure 11. Infrared bolometric luminosity versus redshift, 
colour-coded by the infrared template type. 



plate type. We have indicated the predicted flux limit of 
the Planck Surveyor all-sky survey. We are predicting that 
over 23000 of the sources in our Catalogue will have 350 ^tm 
fluxes above the Planck all-sky survey limit of 0.3 Jy. Planck 
should detect significantly more sources than this because (i) 
not all the sources in our Catalogue have redshift estimates 
and template fits, (ii) some high redshift infrared galaxies 
may fall below our 60 /im detection limit but still be de- 




z 



Figure 12. The predicted 350 um flux in Jy versus redshift, 
colour-coded by the infrared template type. 



tectable at 350 fim. Our Catalogue should however include 
all sources detected by Planck with z < 0.5. 



7 CONCLUSIONS 

We have presented the Imperial IRAS-FSC Redshift Cat- 
alogue (IIFSCz). It contains 60,303 galaxies selected at 60 
/xm from the IRAS Faint Source Catalog, covering around 
61% of the whole sky. The process of retrieving spectroscopic 
redshifts, multiwavelength cross-matching and photometric 
redshift estimation is described in some detail. In the Cata- 
logue, we give the best possible position, IRAS fluxes, opti- 
cal, near-infrared and/or radio identifications, spectroscopic 
redshift (if available) or photometric redshift (if possible), 
predicted fluxes at wavelengths ranging from 12 to 1380 /im. 
Overall, 32,881 galaxies in the IIFSCz (55%) have received 
spectroscopic redshifts from past redshift surveys such as 
the IRAS PSCz, FSSz and 6dF and around 12,000 galaxies 
(~ 20%) obtained photometric redshifts through either the 
training set (for sources with 2MASS or NVSS photome- 
try) or the template-fitting method (for sources with SDSS 
photometry). At a flux limit of S(60)=0.36 Jy, the redshift 
completeness of the Catalogue, including both spectroscopic 
and photometric redshifts, is increased to 90%. 

The IIFSCz provides a huge data set for large-scale 
structure studies and validations of recent and future in- 
frared and submillimetre surveys (e.g. AKARI, Planck and 
Herschel). Potential users should be aware of issues such as 
the intrinsic IRAS FSC completeness limit, the redshift com- 
pleteness variations across the sky and the varying quality 
of photometric redshift derived for different subsets of the 
Catalogue. 
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